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The science achievable with SKA Hi surveys will be greatly increased through the combination 
of Hi data with that at other wavelengths. These multiwavelength datasets will enable studies to 
move beyond an understanding of Hi gas in isolation to instead understand Hi as an integral part 
of the highly complex baryonic processes that drive galaxy evolution. 

As they evolve, galaxies experience a host of environmental and feedback influences, many 
of which can radically impact their gas content. Important processes include: accretion (hot 
and cold mode, mergers), depletion (star formation, galactic winds, AGN), phase changes 
(ionised/atomic/molecular), and environmental effects (ram pressure stripping, tidal effects, 
strangulation). Governing all of these to various extents is the underlying dark matter distribution. 
In turn, the result of these processes can significantly alter the baryonic states in which material 
is finally observed (stellar populations, dust, chemistry) and its morphology (galaxy type, 
bulge/disk ratio, bars, warps, radial profile). To fully understand the evolution of Hi and the role 
it plays in galactic evolution requires the ability to quantify each of these separate processes, and 
hence to coordinate SKA Hi surveys with extensive multi-band photometric and spectroscopic 
campaigns. In addition, multiwavelength data is essential for statistical methods of Hi analysis 
such as Hi stacking and intensity mapping cross-correlations. 

In this chapter, we examine some of the principal science motivations for acquiring multiwave¬ 
length data to match that from the extragalactic SKA Hi surveys, and review the currently planned 
capacity to achieve this (eg. LSST, Euclid, W-FIRST, SPICA, ALMA, and 4MOST). 
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1. Introduction 

Our knowledge of galaxies and their evolution has been dominated by studies of the late-stage 
products of the evolutionary cycle, with extensive surveys in the optical and nearby wavelengths 
tracing the evolution of stellar material across large fractions of cosmic time. In contrast, our 
knowledge of the Hi content of galaxies, the fundamental baryonic material out of which galaxies 
are made and a key tracer of their dynamics, remains rudimentary. Direct measurements of Hi 
content in individual galaxies beyond the local universe are currently limited to galaxy samples of 
only hundreds out to z~0.2-0.3 (BUDHIES, Verheijen et al. 2007; CHILES, Fernandez et al. 2013), 
compared to the multi-million object samples out to z~l and beyond that will be possible with the 
SKA (more than half the history of the Universe). Linking these groundbreaking Hi samples with 
matched multiwavelength tracers will provide transformational new datasets spanning all of the 
major galactic constituents, and provide unique insight into the evolution of galaxies across cosmic 
time. 

In this chapter, we explore the needs and availability of multiwavelength data for extragalactic 
Hi surveys with the SKA. Section 2 summarises some of the principal scientific drivers for ancil¬ 
lary multiwavelength data and the analysis techniques these enable, followed by an overview of 
upcoming multiwavelength survey facilities in Section 3. The implications for future SKA and 
multiwavelength survey design are discussed in Section 4. Finally, conclusions are given in Sec¬ 
tion 5, including a basic ranking of facilities and multiwavelength science products based on the 
degree to which well-matched multiwavelength data will be possible for SKA Hi surveys, as known 
at the time of writing. 

2. Need to Match HI Surveys with Multi wavelength Data 
2.1 Characterising Different Baryonic States 

Understanding the mass assembly of galaxies, and the evolution of material from one baryonic 
state to another, is at the heart of galaxy evolution studies. A large range of processes and scales 
are involved in this assembly, from the on-going gravitational collapse of gas into the filamentary 
structures of the cosmic web and its accretion into galactic halos, the cooling of gas into galactic 
disks and dense molecular clouds, to the subsequent star formation processes within these struc¬ 
tures. In this simple picture, galactic Hi acts as a bridge between the reservoir of gas in the cosmic 
web and the stellar material in galaxies. However, the full picture of Hi content and its role in 
galaxy evolution is likely much more complex, with a host of mechanisms that can variously act 
to both add and remove Hi from galaxies. Potentially important processes here include: accretion 
(hot and cold mode, mergers; eg. Keres et al. 2005), depletion (star formation, galactic winds, 
AGN; eg. Power et al. 2010; Kim et al. 2013), phase changes (ionised/atomic/molecular; eg. Hop¬ 
kins et al. 2008), and environmental effects (eg. Gunn & Gott 1972; Moore et al. 1996). Each 
of these sources and sinks ideally needs to be quantified through multiwavelength observations to 
determine the relative importance of each, and the nature of the fundamental scaling relations that 
link them (eg. the Schmidt law, Schmidt 1959; Table 1 provides a summary of some of the central 
multi wave length tracers and the physical processes and galactic components they measure). The 
SKA will provide an unprecedented window through which to study the role of Hi in these rela- 
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Property 

Tracers 


atomic gas mass 
molecular gas mass 
stellar mass 
dust mass/temperature 
morphology 

star formation rate 
supermassive black hole 
chemistry 
dynamics 

environment 


distance 


Hi 

mm emission lines (eg. CO) 

optical/NIR multiband photometry (eg. K, g-i), optical spectroscopy 
mid/far-IR multiband photometry, (sub)mm continuum 
panchromatic high resolution imaging, fitted structural parameters and 
decomposition (bulge, disk, bar) 

Ha, UV, radio continuum, mid-IR 
optical emission line diagnostics, radio continuum 
optical/submm/mm spectroscopy via emission/absorption lines (high R) 
line fitting (optical, near-IR, CO, HI), spatially resolved (optical IFU, 
CO, HI) 

redshift surveys (low R) + group catalogues (membership, multiplic¬ 
ity, halo mass, central/satellite classification), photo-z (multiband op¬ 
tical/NIR, large scale structure). X-rays (clusters, hot groups, massive 
halos), radio SZ (clusters, high-z) 

redshift (Hubble flow), Tully-Fisher (HI+optical/NIR imaging), Faber 
Jackson/Fundamental Plane (optical/NIR imaging and spectroscopy) 


Table 1: Physical quantities and their multiwavelength tracers 


tions across a large fraction of the history of the Universe, and will do so on both a global and 
morphologically resolved basis. 


2.2 Characterising Galaxy Dynamics 

Along with mass, another key property in the evolution of galaxies are their kinematics and an¬ 
gular momenta. Hi observations with the SKA will be a powerful addition to this field, with SKA1 
alone expected to yield two orders of magnitude more good Hi kinematic maps than the leading ex¬ 
isting datasets (Obreschkow et al. 2014), and more than a million galaxies with spatially unresolved 
Hi profiles. While a powerful diagnostic in their own right, with Hi gas extending further into the 
dark matter haloes of galaxies than can be typically traced by stellar emission, the potential of these 
data will be best exploited through its combination with tracers at other wavelengths. At the most 
basic level, higher resolution optical photometry enables inclination corrected rotation velocities 
to be extracted from line-of-sight global Hi profiles, along with improved morphological param¬ 
eters such as disk scale lengths. At a more detailed level, matched optical/NIR spectroscopy and 
IFU data provide complimentary measurements of the stellar-phase and ionized gas-phase mate¬ 
rial, which combined with Hi data will allow the full kinematic and angular momentum properties 
of galaxies to be studied, linking the often different evolutionary information encoded in the kine¬ 
matics of gas and stars, and indeed tracing the dynamics of different morphological features such 
as disks and bulges. Through both joint and separate analyses. Hi and multiwavelength data will 
enable the full suite of galactic dynamical scaling relations to be examined (eg. Tully & Fisher 
1977; Faber & Jackson 1976; M*-,So 5 relation, Kassin et al. 2007), and their evolution, underlying 
physics, and observational biases to be understood. 
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2.3 Characterising Environment 

Some of the most important external effects driving the distribution of galaxy properties we 
observe today are the environmental factors that can drastically impact Hi content. For instance, 
the suppression of star formation in cluster environments is not due to an alteration in the actual 
processes of star formation, but rather due to a quenching of the Hi supply (tidal influences, Moore 
et al. 1996; ram pressure stripping, Gunn & Gott 1972; strangulation, Balogh et al. 2000). However, 
the detailed processes by which environment affects Hi remain only partially' understood and will 
be an important area of research for the SKA. In the case of star formation quenching, for example, 
major optical surveys such as 2dFGRS find that star formation is suppressed at large distances from 
clusters, implying that evolutionary processes must be at work beyond these environments and the 
simple ram-pressure stripping of Hi gas in their cores (Lewis et al. 2002). Existing studies also 
find contradictory results on the environmental dependence of the Hi mass function (Zwaan et al. 
2005; Springob et al. 2005). 

A potential contributor to some of the observational uncertainties and disagreements in ex¬ 
isting environmental effect results are difficulties associated with properly characterising environ¬ 
ment and the variety of different methods that have been used (eg. Hi vs. optical metrics). From 
an observational perspective, overcoming these challenges requires the combination of Hi data 
with deep, well-sampled optical redshift samples to provide a full characterisation of environment 
(group membership, group multiplicity, parent halo mass, central/satellite status). Together, these 
datasets will sample a greater dynamic range of environment than is possible from examining the 
distribution of Hi-rich galaxies alone, which arc strongly anti-biased in the local Universe. Direct 
tracers of the intra-group/cluster medium, such as that provided by X-ray data, will also be a highly 
valuable addition to the group catalogues provided by redshift surveys. Interferometric Hi obser¬ 
vations with sufficient resolution to isolate emission from galaxies in close proximity will also be 
needed to mitigate confusion effects in these environments, a requirement that will be met over a 
large redshift range by the SKA. 

2.4 Characterising Feedback 

The feedback effects from local processes such as star formation and AGN activity can be 
equally as important as those on larger scales. For instance, recent simulation results find that 
feedback from supernovae impacts the overall normalisation of the Hi mass function, while AGN 
feedback can impact its high mass slope, with increased activity decreasing the number of high Hi 
mass galaxies observed (Kim et al. 2013). The strength of these feedback mechanisms is also found 
to have an inverse effect on the global clustering strength of the Hi-rich galaxy population. Impor¬ 
tantly, the various feedback mechanisms arc also found to imprint differently on the distribution 
functions of the Hi-rich galaxy population compared to that observed at other wavelengths, offer¬ 
ing an improved diagnostic ability for understanding the role of feedback in galaxy evolution when 
Hi datasets arc combined with those selected, for instance, in the optical. Nevertheless, a great 
deal of progress remains to be made in this area, and favoured models still offer sometimes widely 
discrepant results with observations (Duffy et al. 2012). Large-scale Hi surveys with the SKA, 
combined with matching multiwavelength data, will be essential for resolving these uncertainties, 
tracing the causes and varying effects of feedback across all of the major galactic constituents. 
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2.5 Enabling Alternative Analysis Techniques 

Finally, multiwavelength data can also offer new ways of analysing Hi datasets. Notably, 
the combination of Hi data with independent optical redshift catalogues has been used to vastly 
increase the redshift range over which measurements of the cosmic Hi density and gas fraction 
scaling dependencies can been made. In one technique, Hi stacking, the rest frame Hi spectra of a 
large number of sources - extracted from an Hi datacube and shifted to rest frame on the basis of 
their known optical positions and redshifts - arc combined to obtain a statistically significant aver¬ 
age spectrum for the input galaxy sample. This method has now been used to successfully obtain 
high redshift determinations for Gin in both single-dish and interferometric observations (Lah et al. 
2007; Delhaize et al. 2013; Rhee et al. 2013). The technique has also been demonstrated for absorp¬ 
tion line studies (eg. Gereb et al. 2013, 2014). Another method is intensity mapping, which rather 
than trying to stack the emission from individual sources and correct for any confusion effects that 
may arise, instead analyses the intensity field of emission as a whole, through auto-correlation or 
cross-correlation of the Hi intensity field with that derived from an optical spectroscopic catalogue. 
In the last decade, great strides have been made in the application of this technique to measure the 
global Hi content of the Universe (Pen et al. 2009; Chang et al. 2010). Lastly, independent optical 
redshifts can be used to measure the Hi properties of individual sources at a much lower signif¬ 
icance level that would be possible from the blind Hi data alone, significantly increasing sample 
sizes for galaxy-by-galaxy analyses. 

3. Multiwavelength Facilities & Surveys 

In this section, we review current and planned multiwavelength survey facilities, examining 
these by survey class (imaging, spectroscopic, IFU) and assessing their relative performance as a 
function of wavelength to identify those best suited to provide matching multiwavelength data for 
the strawman SKA Hi surveys. Note that we have not differentiated between facilities with open 
data access policies and those with restricted proprietary periods. However, over the lifetime of the 
SKA it is expected that all of these datasets will become publicly available, as is normal astronomy 
practice. 

Given inevitable variations in survey design, there is not a single figure of merit (FoM) that 
covers all scientific purposes. For instance, for spectroscopic surveys an experiment designed 
to cover a huge area of sky at low redshift will need a large field of view but not necessar¬ 
ily huge multiplexing or collecting area given the distribution of sources on the sky. Equally, a 
high redshift study might need good multiplexing and a large collecting area but not a huge field 
of view. To allow for these design variations in desired FoMs we have designed an interactive 
website where users can build their own FoM, amongst other interactive options. The aim is to 
keep this database up-to-date with a moving 10-year window. The current host location is at: 
https://asgr.shinyapps.io/ganttshiny/. 

3.1 Optical/near-IR imaging surveys 

From a multiwavelength standpoint, imaging surveys represent the base from which other, 
more detailed follow-up observations can follow independently of the Hi selected dataset, and are 
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Imaging Extra-Galactic Survey Facilities: FoM=Area.FoV/FWHM 2 

2014 2016 2018 2020 2022 2024 


Wave 


Fermi-LAT (N/S) 
HESS-11 (S 
HAWC (N 
Gamma400 (N/S 
CTA N/S 
XMM/Newton N/S 
eROSITA N/S 
WSO-UV N/S 
CFHT/MegaCam (N 
HST/WFC-UVIS3 (N/S 
Oschin/iPTF (N 
Magellan/MEGACAM (S 
Pan-STARRS-1 (N 
VST/OmegaCam (S) 
Blanco/DECam (S) 
SkyMapoer (S) 
Subaru/HSC (N 
Pan-STARRS-2 N 
WHT/PAUcam N 
Javalambre (N) 
Oschin/ZTF N 
Euclid Opt (N/S) 
LSST (S 
WFIRST-AFTA Opt (N/S 
VISTA/VIRCam (S 
Magellan/FOURSTAR (S 
HST/WFC3-IR N/S 
JWST/NIRCam N/S 
Euclid NIR N/S 
WFIRST-AFTA NIR N/S 
Spitzer/IRAC N/S 
WISE N/S 
SPICA MIR N/S 
SPICA FIR (N/S 
JCMT/Scuba2 (N 
COAT (S 
ALMA S 



gamma 

X-ray 

UV 

opt 

NIR 

MIR 

FIR 

smm 

mm 

best 

best/10 

best /10 2 
FoM 



IFU Extra-Galactic Survey Facilities: FoM=A.FoV.N fib /FWHM 2 

2014 2016 2018 2020 2022 2024 


Calar-Alto/PPAK (N) 
AAT/SAMI S 
SDSS/MaNGA (opt) N 
VLT/MUSE S 
GTC/GO-IRS N 
HET/VIRUS (N 
AAT/Hector (S) 
WHT/WEAVE (opt) (N 
SDSS/MaNGA (NIR N 
VLT/KMOS S 
WHT/WEAVE (NIR) (N) 



Wave 



best/3^* 

best/10^B 

best/301_I 

FoM 


Spectroscopic Extra-Galactic Survey Facilities: FoM=Area.FoV.N lib /FWHM 2 
2014 2016 2018 2020 2022 2024 



MMT/Hectospec (N) 
AAT/2dF-AAOmega (S) 
Magellan/IMACS (S) 
LAMOST (N) 
GTC/GO-IRS (N) 
Sloan/BOSS (N) 
UK-Schmidt/TAIPAN (S) 
Subaru/PFS (N) 
WHT/WEAVE (N) 
Mayall/DESI (N) 
VISTA/4MOST (S) 
MSE (N) 
Keck/MOSFIRE (N) 
VLT/MOONS (S) 
Euclid (N/S) 
WFIRST-AFTA (N/S) 
CCAT (S) 


Wave 

opt 

NIR 

smm 

best 
best/10 
best/10 2 

best /10 3 
FoM 



Radio Extra-Galactic Survey Facilities: FoM=(Area/T) 2 FoV 

2014 2016 2018 2020 2022 2024 

GMRT (N) 

Parkes/MB13 (S) 

GBT (N) 

Effelsberg/7-beam (N) 

Arecibo/ALFA (N) 
e-MERLIN (N) 

JVLA (N) 

KAT-7(S) 

ASKAP-12 (S) 

WSRT/APERTIF (N) 

FAST (N) 

ASKAP (S) 

MeerKAT (S) 

SKA1-SUR (S) 

SKA1-MID (S) 

LOFAR (N) 

MWA (S) 

SKA1 -LOW (S) 



Wave 



best 

best /10 2 

best /10 4 

best/10 6 

FoM 



Figure 1: (top left) Comparison of various extra-galactic imaging survey facilities that will operate between 
now and 2025. Figure of merit (FoM) is calculated using Area.FoV/(FWHM 2 ). (top-right) Comparison 
of various extra-galactic spectroscopic survey facilities that will operate between now and 2025. FoM is 
calculated using A re a. Fo V. Ny,/, /(l ; W H M 2 ). (bottom-left) Comparison of various extra-galactic IFU sur¬ 
vey facilities that will operate between now and 2025. FoM is calculated using Area.FoV.N ra,/(1 -'WHM 2 ). 
(bottom-right) Comparison of various extra-galactic radio survey facilities that will operate between now 
and 2025. FoM is calculated using (Area/T) 2 .FoV. ‘Hi’ corresponds to facilities able to observe Hi in 
the local Universe, i.e. they can observe at frequencies as high as 1.4 GHz. Note that in all panels FoM 
shading is scaled within a wavelength subset. The accompanying data for all charts is given in the Ap¬ 
pendix. Versions of these charts and table data can be generated online using an interactive tool located at 
https://asgr.shinyapps.io/ganttshiny. 


therefore a crucial starting point. The survey imaging needs to be competitive otherwise all that 
follows will be similarly limited. 

There are a significant number of imaging facilities that have recently come online or will 
come online in the next decade. The top-left panel of Figure 1 is a Gantt-Chart representation of 
when these facilities might become available, and how ‘fast’ they are at surveying the extra-galactic 
sky. The chosen metric provides an insight into depth (telescope area) and sky coverage (FoV) per 
unit time, and data quality (FWF1M image resolution). The majority of facilities will operate in the 
optical and there are two clear front-runners: Subaru Hyper-SuPrime-Cam, and towards the end of 
the decade, LSST. In the near-IR, arguably the more obvious band for sample selection as it most 
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closely follows stellar mass, the obvious winner is VISTA, moving to Euclid towards the end of the 
decade. 

The SKA will be largely accommodated by the huge Southern sky survey area and depth of 
LSST (at least in the optical). For this reason we have confidence that a large fraction of the 
imaging side of the SKA multiwavelength data will be either in place, or appearing, by the time 
SKA begins operations. This is one part of the multiwavelength equation the SKA will not need to 
worry about going forward, but it should be noted that near-IR surveys will be less well matched 
in both depth and area. 

3.2 Massively multiplexed spectroscopic surveys 

Moving into the next generation of surveys, there will need to be a natural synergy with op¬ 
tical and NIR spectroscopic campaigns. There are a number of multi-object spectroscopic (MOS) 
facilities available currently, and that are scheduled to come online between 2015-2025. Figure 1 
(top-right) is a Gantt-Chart representation of when these facilities might become available, and how 
fast they are at spectroscopically surveying extra-galactic objects. 

One of the fastest MOS facilities available is AAT/AAOmega. This has been used to conduct a 
number of multiple hundred thousand sized extra-galactic surveys: 2dFGRS, WiggleZ and GAMA. 
Fooking forward, there are a huge number of next generation MOS facilities scheduled to start 
operating in the Northern hemisphere before between 2017-2019, however only VISTA/4MOST 
will be operating in the South (i.e. overlapping substantially with ASKAR MeerKAT and SKA). 
For this reason alone it would be a compelling next generation MOS facility for the SKA to become 
involved in. It is worth noting that these next-generation MOS facilities are all comparably fast, 
and the reality is the faster extra-galactic surveys will be conducted on the most dedicated facilities. 

3.3 Spatially resolved spectroscopic surveys 

An equally important direction for next generation survey science is spatially resolved spec¬ 
troscopy through the development of novel and pioneering instrumentation (e.g. AAT/SAMI, 
VFT/MUSE, VFT/KMOS). A key power of these instruments is their ability to spatially resolve the 
chemistry and dynamics of galaxies - star-formation, metallicity, stellar population ages, angular 
momenta and stellar dispersions. Current spatially resolved surveys are at the thousand galaxy mark 
(the Spain-led CAFIFA, the Australian-led SAMI and the US-led MANGA surveys). ATFAS 3D 
has also compiled a large sample of IFU and Hi measurements for early-type galaxies with WHT 
and WSRT. 

The AAT/SAMI facility is the only multi-object IFU facility currently available, but others 
are scheduled to come online between 2015-2025. Figure 1 (bottom-left) is a Gantt-Chart rep¬ 
resentation of when these facilities might become available and how fast they are at surveying 
extra-galactic objects with IFUs. Simultaneously probing gas and stellar kinematics through the 
combination of SKA and optical/NIR IFU data will be a huge science lever in the near future. Be¬ 
yond this, high S/N chemistry can provide great insight into the conversion of Hi and molecular 
gas into stars and metals, but a full understanding will require large numbers of galaxies in huge 
spatial and multiwavelength detail. 
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4. Optimal Matching of HI and Multiwavelength Datasets 

The minimal obvious requirement to maximise scientific return from SKA Hi surveys is to 
ensure that multiwavelength survey programs are carried out in the same regions of sky as those 
proposed for Hi. Beyond that, there are clearly pragmatic considerations regarding the depth and 
area of such overlapping multiwavelength surveys, with consideration needed for likely source 
densities, survey times and expense. We perform a basic assessment of the optimal required depth 
for multiwavelength data as a function of SKA survey area and integration time by analysing sim¬ 
ulated data from S3-SAX (Obreschkow et al. 2009; Obreschkow & Meyer 2014). This data spans 
the redshift range 0 < z < 1.2 in a 100 deg 2 volume-limited light-cone containing galaxy cold gas 
masses, Hi and H 2 masses, stellar masses, luminosities, and apparent magnitudes. We apply ap¬ 
parent Hi, r-band and CO detection thresholds to assess the degree of overlap between a range of 
simulated observed galaxy datasets. For the detection of Hi sources by the SKA in this analysis, a 
5a (optimal) threshold is applied. A synthesised beam size of 5” is used for all surveys, with the 
point source signal-to-noise value of galaxies being adjusted according to their size relative to the 
beam (SNR l/-y/l TAg a i aX y/Ab earn)* 

The first metric we consider is the simple fraction of Hi sources detected in a simulated Hi 
SKA survey (sample A) that will have a counterpart in an r-magnitude limited dataset (sample 
By. F = r-band selection is used for redshift surveys such as GAMA due to the presence of 
Hct (for higher redshift samples a longer observed frame wavelength selection may be preferable). 
The results of this analysis are shown in Figure 2, with separate panels given for SKA1 (50%), 
SKA1 and SKA2. We also include a plot for the match between SKA1 and a CO(l-O) flux limited 
dataset. For SKA1 and its 50% early deployment option, four separate y-axis are included for the 
four nominal reference surveys considered by the Hi Science Assessment Workshop (September 
2013): 3 deg 2 (SKA1-MID and 50% option), 300 deg 2 (SKA1-SUR and 50% option), 3000 deg 2 
(SKA1-SUR and 50% option) and 30000 deg 2 (SKA1-SUR and 50% option). For SKA2, three 
survey areas are examined: 60 deg 2 , 600 deg 2 and 6000 deg 2 . A horizontal dashed line indicates 
the reference integration times for each survey: 2000 hours for the 3 deg 2 survey with SKA1-MID 
and the 60-6000 deg 2 SKA2 surveys, and 2 years for the 300-30000 deg 2 surveys with SKA1-SUR. 
The vertical intersect for each dashed survey line then corresponds to the required r-band limit for 
90% of the Hi sample to have an optical counterpart. 

While achieving high completeness (well-suited eg. for tasks such as optically motivated 
source finding, or an indicative limit for where Hi stacking gains interest), the above identified r- 
band samples will be comparatively inefficient when viewed from an optical standpoint, containing 
many sources that will have no Hi counterpart. To assess the optimal r-band (and CO) depth at 
which the mutual overlap between Hi and multiwavelength samples is maximised, we consider a 
second metric E = where a value of 0 indicates that the two selected populations 

are disjoint, and a value of 1 indicates the selected populations are identical. The results of this 
analysis are shown in Figure 3, with panels arranged as for Figure 2. We note that the S3-SAX 
simulations used become incomplete for stellar/gas masses below ~ 1O S M 0 , a mass limit that may 
lead to an overestimate of survey overlap in the top-right portions of these diagrams compared to 
the full-mass distribution of galaxies in the Universe. The dashed lines now indicate the r-band 
and CO flux thresholds corresponding to the optimal identified limits where the mutual overlap 
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Figure 2: (top-left, top-right, bottom-left) Fraction of Hi SKA survey sources (SKA1 50%, SKA1, SKA2 
respectively; as a function of integration time) that are detected in an r-band apparent magnitude-limited 
sample (down to a given magnitude). The colour-scale indicates the value of the metric F = Different 
y-axis give the integration time scales for different survey areas. For SKA1 50% and SKA1, four surveys are 
considered: 3 deg 2 using SKA1-MID (and 50%; black), 300 deg 2 using SKA1-SUR (and 50%; red), 3000 
deg 2 using SKA1-SUR (and 50%; green), and 30000 deg 2 using SKA1-SUR (and 50%; blue). For SKA2, 
three surveys are considered: 60 deg 2 (black), 600 deg 2 (red), and 6000 deg 2 (green). Dotted lines indicate 
the r-magnitude limit needed to achieve matches for 90% of sources in each SKA survey area (given 2000 
hrs of integration for SKA1-MID & SKA2 surveys, and 2 years of telescope time for SKA1-SUR surveys), 
(bottom-right) Fraction of Hi SKA1 survey sources that are detected in a CO(l-O) flux sensitivity limited 
sample, with colour-scale, colour-axes and indication lines as before. 


between the Hi and ancillary datasets are largest. The redshift and stellar mass distributions for the 
resultant Hi and r-band limited samples are shown in Figure 4. 

From these plots it can be seen that the optical imaging depth requirements for all the examined 
SKA surveys (up to, and including the full SKA2) will be well met by surveys such as those 
proposed for LSST. Only the very deepest small-area Hi surveys may wish to coordinate with the 
LSST deep fields rather than the main survey (r ~ 28 mag vs. ~26.5 mag). Black/white arrows 
are also included in the r-band plots to indicate the depths of some existing and planned low-SNR 
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CO flux sensitivity (mJy per 8GHz) 



Figure 3: (top-left, top-right, bottom-left) Degree of mutual overlap between the Hi sources detected by 
SKA telescopes (SKA1 50%, SKA1, SKA2 respectively; as a function of integration time) and an optically 
selected r-band sample (down to a given magnitude). The colour-scale indicates the value of the metric 
E = where E = 0 indicates that the two selected populations are disjoint, and E = 1 indicates 

the selected populations are identical. Different y-axis give the integration time scales for different survey 
areas. For SKA1 50% and SKA1, four surveys are considered: 3 deg 2 using SKA1-MID (and 50%; black), 
300 deg 2 using SKA1-SUR (and 50%; red), 3000 deg 2 using SKA1-SUR (and 50%; green), and 30000 
deg 2 using SKA1-SUR (and 50%; blue). For SKA2, three surveys are considered: 60 deg 2 (black), 600 
deg 2 (red), and 6000 deg 2 (green). Dotted lines show the optimal r-magnitude matching limit for each SKA 
survey area (given 2000 hrs of integration for SKA1-MID & SKA2 surveys, and 2 years of telescope time for 
SKA1-SUR surveys), (bottom-right) Degree of mutual overlap between the Hi sources detected by SKA1 
and a CO(l-O) flux sensitivity limited sample, with colour-scale, colour-axes and indication lines as before. 


optical redshift surveys (SDSS, GAMA, WAVES). These show that the optimal redshift sample 
depth requirements for SKA1-SUR and SKA1-SUR 50% Hi surveys will be met by programs 
such as the WAVES design-reference survey on 4MOST (within its proposed < 1000 deg 2 area). 
However, for SKA2, suitably matched redshift samples will require the development of a new class 
of multiplexed spectroscopic survey facility compared to the current generation of existing (and 
planned) 4m telescopes. From the black arrows in the CO plots, showing example blind surveys 
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Figure 4: Stellar mass and redshift distributions of the optimal Hi and r-band limited samples as identified 
by the dashed lines in Figure 3 for SKA1 (50%; top-left), SKA1 (top-right) and SKA2 (bottom). Redshift 
distributions reflect the full number counts for each Hi and r-band limited survey, while one thousandth of 
each sample is plotted in the stellar mass distributions. Hi survey integration times are: 2000hrs (SKA1- 
MID, SKA1-MID 50%: 3 deg 2 ; SKA2: 60, 600 and 6,000 deg 2 ); and 2 years (SKA1-SUR, SKA1-SUR 
50%: 300, 3000 and 30,000 deg 2 ). 

that could be achieved with ALMA, it can be seen that the vast difference in survey speeds between 
ALMA and the SKA mean that ALMA observations will largely be restricted to targeting Hi sub¬ 
samples. 

5. Conclusion 

The SKA offers a unique opportunity to understand the evolution of galaxies through large- 
scale surveys of the most fundamental baryonic building block of the Universe: Hi. The scientific 
opportunity provided by this capacity will be maximised by linking these data to that at other wave¬ 
lengths, shedding light on the host of regulatory processes that govern the transfer and conversion 
of material from one baryonic state to another. At present, the planned capability for the acqui¬ 
sition of appropriate multiwavelength data for extragalactic Hi surveys varies significantly as a 
function of frequency and required survey area for SKA1. Excellent optical/NIR imaging will be 
available through facilities such as LSST, Euclid and W-FIRST (indeed also extending to depths 
suitable for SKA2), caveat any issues that may arise through a need to coordinate target areas, 
or that potentially arise from restricted access to proprietary data. However, there is a compara¬ 
tive lack of planned capacity to carry out complementary imaging surveys at other wavelengths. 
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This will restrict the ability to trace important physical quantities such as star formation and dust 
content. Molecular line observations are similarly constrained with only targeted follow-up obser¬ 
vations currently possible over larger areas with facilities such as ALMA, rather than blind surveys 
of matching regions. The landscape for optical spectroscopy is significantly better, with suitable 
surveys expected to be possible with facilities such as VISTA/4MOST, Subaru/PFS, Mayall/DESI 
etc. However, the sensitivity of these facilities will likely only be good enough to provide redshift 
information, and not the required 10+ S/N spectra required to trace chemistry which will need the 
development of 10+m dedicated spectroscopic survey facilities. The availability of these various 
required multiwavelength data products can loosely be broken down into five categories as sum¬ 
marised below. 

Facility grades: 

1. Planned capacity exists, and blind ancillary survey data is expected to exist without coordi¬ 
nation of target areas: optical imaging (LSST). 

2. Planned capacity exists, and blind ancillary survey data could exist with coordination of 
target areas: NIR imaging (Euclid: < 8000 deg 2 , W-FIRST); redshift emission spectra 
(S/N ~ 2). 

3. Planned capacity exists, and fully sampled ancillary data could exist, but only in a targeted 
follow-up mode of worthwhile sources: optical IFU; gas-phase emission spectra (S/N 10). 

4. Planned capacity exists, and significantly sampled (~ 1-10%) ancillary data could exist, but 
only in a targeted follow-up mode: mm (ALMA), FIR imaging (SPICA). 

5. Little capacity is expected to be available in the coming decade, and only poorly sampled 
(< 1%) ancillary data are possible: UV imaging; stellar-phase absorption spectra (S/N ~ 
100 +). 
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(2013) which contains a good summary of radio facilities. For a more detailed comparison of radio telescope performance characteristics beyond the first order 
approach used here, the reader is referred to Popping et al. (2014) in this volume. 
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Table 4: Table comparing various extra-galactic IFU survey facilities that will operate between now and 2025. 
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Table 5: Table comparing various extra-galactic imaging survey facilities that will operate between now and 2025. 



